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Abstract In this paper we construct constant dimension space codes with pre- 
scribed minimum distance. There is an increased interest in space codes since a 
paper [13] by Kotter and Kschischang were they gave an application in network 
coding. There is also a connection to the theory of designs over finite fields. We 
will modify a method of Braun, Kerber and Laue [7] which they used for the con- 
L^ , struction of designs over finite fields to do the construction of space codes. Using 

this approach we found many new constant dimension spaces codes with a larger 

ryj \ number of codewords than previously known codes. We will finally give a table 

O ■ of the best found constant dimension space codes. 



network coding, q-analogue of Steiner systems, subspace codes 
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1.1 Subspace Codes 

In [13] R. Kotter and F. R. Kschischang developed the theory of subspace codes for ap- 
plications in network coding. We will recapitulate their definitions in a slightly different 
manner. We denote by L(GF(q) v ) the lattice of all subspacesof the space of dimension 
v over the finite field with q elements. A subspace code C is a subset of L(GF(q) v ). If 
all the subspaces in C are of the same dimension then C is a constant dimension code. 

The subspace distance between two spaces V and W in L(GF(q) v ) is defined as 
d s (V, W) := dim(V + W)- dim{V n W) 



which is equal to 



dim(V) + dim(W) - 2dim{V n W). 



This defines a metric on L(GF(q) v ). The minimum (subspace) distance of a subspace 
code C is defined as 

D S (C) ■- min{d s (V, W):V,W G C and V ^ W}. 

We define now the optimal (subspace) code problem: 

(PI) For a given lattice L(GF(q) v ) fix a minimum (subspace) distance d 
and find the maximal number m of subspaces V\ , . . . , V m in L(GF(q) v ) such 
that the corresponding subspace code C = {Vi, . . . , V m } has at least minimum 
distance d. 

The following point of view is useful for the study of subspace codes: One of the clas- 
sical problems in coding theory can be stated as follows: 

(P2) Given the Hamming graph of all words of length v and a minimum 
distance d find a maximal number m of words such that the pairwise minimum 
distance is at least d. 

If we substitute the Hamming graph by the Hasse diagram of L(GF(q) v ) the problem 
(P2) becomes problem (PI). Both problems are special cases of a packing problem in 
a graph. If we start with problem (P2) and use the 'field with one element' we get the 
problem (PI). Because of this property we say (P2) is the q— analogue of (PI). This 
connection is well known (e.g. [1,17]) and will be useful in the following. Since the 
publication of the paper by Kotter and Kschischang the constant dimension codes as the 
q— analogue of the constant weight codes were studied in a series of papers [12,10,23]. 

1 .2 q —Analogues of Designs 

A t — (v, k, A) design is a set C of fc— element subsets (called blocks) of the set 
{1, . . . , v} such that each t— element subset of {1, . . . , v} appears in exactly A blocks. 
The special case of A = 1 is called a Steiner system. 

The same construction which was used to connect problem (PI) to (P2) in the subsection 
above can be used to define the q— analogue of a t— design. A t — (v, k, A) design over 
the finite field GF(q) is a multiset C of fc— dimensional subspaces (called q— blocks) 
of the v— dimensional vector space GF(q) v such that each t— dimensional subspace of 
GF(q) v is subspace of exactly A q— blocks. 

The connection with the constant dimension codes is given by the following observation 
in the case of a q— analogue of a Steiner system: Given a g— analogue of a t — (v, k, 1) 
design C we get a constant dimension code of minimum distance 2(fc — 1+ 1). As each 
t— dimensional space is in only one fc— dimensional subspace the intersection between 
two spaces from C is at most (t — 1)— dimensional. Therefore the minimum distance of 
Cis at least 2(fc — 1+ 1). On the other hand given any (t— 1)— dimensional subspace V 



we can find two t— dimensional spaces U, W with intersection V and then two unique 
q— blocks containing U and W. The minimum distance between these q— blocks is 2(fc— 
t+1). 

q— analogues of designs were introduced by Thomas in 1987 [19]. Later they were 
studied in a paper by Braun et al. [7] were the authors constructed the first non-trivial 
q— analogue of a 3— design. We will use the methods described in their paper to con- 
struct constant dimension space codes. 

In later papers by Thomas [20] and Etzion and Schwartz [17] it was shown that there 
are severe restrictions on the possible existence of g— analogues of Steiner systems. We 
will search for a collection of subspaces satisfying only the conditions given by (PI) 
and not for the stronger condition satisfied by a q— analogue of a Steiner system. In 
general this method could also be used for the search for Steiner systems. 



2 Construction of Constant Dimension Codes 



In this section we describe how to construct a constant dimension code C using a system 
of Diophantine linear equations and inequalities. Due to the definition of the subspace 
distance for all V 7 W E C we have ds(V, W) = 2k - 2dim(V fl W) where fc is the 
dimension of the code. Thus the minimum subspace distance has to be an even number 
less or equal to 2k. To construct a constant dimension subspace code of dimension k and 
minimum subspace distance 2d we have to find n subspaces {Vi, . . . ,V n } of dimension 
k such that there is no subspace of dimension k — d + 1 contained in two of the selected 
fc— spaces. We define M as the incidence matrix of the incidence system between the 
(k — d+ 1)— spaces (labeling the rows of M) and the fc— spaces (labeling the columns): 

J 1 if V contains W, 

' ' [ otherwise. 

Using M we get the description of a constant dimension code as the solution of a 
Diophantine system. We denote by s the number of columns of M. 

Theorem 1. 

There is a constant dimension code with m codewords and minimum distance at least 
2d if and only if there is a (0 /l)— solution x — (x\, . . . , x s ) T of the following system 
of one equation and one set of inequalities: 

(1) x\ + . . . + x s = m, 

(2) 

Mx < 



This set of equation has to be read as follows: A solution x has the property that the 
product of x with a single row of M is or 1. To get the constant dimension code 
corresponding to a solution we have to use the (0/1)— vector x as the characteristic 
vector of a subset of the set of all fc— dimensional subspaces of GF(q) v . Theorem 1 is 
a generalization of the Diophantine system describing the search for a q— analogue of a 
Steiner system which was given in [7]. 

Corollary 1. [7] 

There is a q— analogue of a (k — d + 1) — (v, fc, 1) design with b blocks if and only if 
there is a (0/1)— solution x = (x\, . . . , x s ) T of the following system of Diophantine 
linear equations: 

(1) X\ + . . . + x s = b, 

(2) 

Mx = 



The size of these problems is given by the number of subspaces in GF(q) v . In general 
this number is growing too fast. The number of fc— dimensional subspaces of GF(q) v 
is given by the q— binomial coefficients: 



n 



(l-qv+i-i) 



(1 
j=i..k y 



Already in the smallest case of a q— analogue of the Fano plane for q = 2 the matrix M 
has 11811 columns and 2667 rows. 



3 Constant Dimension Codes with prescribed Automorphisms 

To handle also larger cases we apply the following method. We no longer look for 
an arbitrary constant dimension code. We are now only interested in a set of spaces 
which have a prescribed group of automorphisms. An automorphism ip of set C — 
{Vi, . . . , V m } is an element from GL(v, GF(q)) such that C = {ip(V\), . . . , ip(V m )}. 
We denote by G the group of automorphisms of C, which is a subgroup of GL(v, GF(q)). 

The main advantage of prescribing automorphisms is that the size of the system of 
equations is much smaller. The number of variables will be the number of orbits of G 
on the fc— spaces. The number of equations or inequalities will be the number of orbits 
on the (k — d + 1)— spaces. The construction process will then have two steps: 

- In a first step the solution of a construction problem is described as a solution of a 
Diophantine system of linear equations. 



- In a second step the size of the linear system is reduced by prescribing automor- 
phisms. 

This construction method is a general approach that works for many discrete structures 
as designs [14,3], q-analogs of designs [6,7], arcs in projective geometries [8], linear 
codes [2,4,5,15] or quantum codes [21]. 

The general method is as follows: The matrix M is reduced by adding up columns 
(labeled by the fc— spaces) corresponding to the orbits of G. Now because of the relation 

Vt-^ subspace of V ^==^ <p{W) subspace of <p(V) (1) 

for any A:— space V and (fc — d)— space W and any automorphism <p G G the rows 
corresponding to lines in an orbit under G are equal. Therefore the redundant rows are 
removed from the system of equations and we get a smaller matrix denoted by M G . The 
number of rows of M G is then the number of orbits of G on the (fc — rf+1)— spaces. The 
number of columns of M G is the number of orbits of G on the fc— spaces. We denote by 
oji, . . . the orbits on the fc— spaces and by fi\, . . . the orbits on the (fc — d+ 1)— spaces. 
For an entry of M G we have: 

M n it v = \i v eujj :W is a subspace of V}\ 

where W is a representative of the orbit Qi of (fc — d+ 1)— spaces. Because of property 
(1) the matrix M is well-defined as the definition of M G , w is independent of the rep- 
resentative W. Now we can restate the above theorem in a version with the condensed 
matrix M G : 

Theorem 2. 

Let G be a subgroup of GL{v, GF{q)). There is a constant dimension code of length 
m and minimum distance at least 2d whose group of automorphisms contains G as 
a subgroup if and only if there is a (0/1)— solution x = (x\, . . .) T of the following 
system of equations: 

(1) |wi| Xi + . . . = m, 

(2) 




There is one further reduction possible. We are looking for a (0/1)— solution where 
each inner product of a row of M G and the vector x is less or equal to 1. We can 
remove columns of M G with entries greater than 1. This gives a further reduction of 
the size of M G . After this last removal of columns we again check on equal rows and 
also on rows containing only entries equal to zero. These all zero rows and all but one 
copy of equal rows are also removed. 



In order to locate large constant dimension codes with given parameters q, k and 2d we 
try do find feasible solutions x = (xi, . . .) T of the system of equations of Theorem 3 
for a suitable chosen group G and a suitable chosen length m. Here we remark that we 
have the freedom to change equality (1) into 



y^ \u>i\xi > m. 



For this final step we use some software. Currently we use a variant of an LLL based 
solver written by Alfred Wassermann [22] or a program by Johannes Zwanzger [24] 
which uses some heuristics especially developed for applications in coding theory. The 
advantage of the LLL based solver is that we definitely know whether there exist feasi- 
ble solutions or not whenever the program runs long enough to terminate. Unfortunately 
for the examples of Section 5 this never happens so that practically we could only use 
this solver as a heuristic to quickly find feasible solutions. 

If we change equality (1) into a target function 

fix) = /(Xi,...) = \iVi\Xi 

we obtain a formulation as a binary linear optimization problem. In this case we can 
apply the commercial ILOG CPLEX 11.1.0 software for integer linear programs. The 
big advantage of this approach is that at every time of the solution process we have 
lower bounds, corresponding to a feasible solution with the largest /(x) -value found so 
far, and upper bounds on f(x). 

We can even reformulate our optimization problem in the language of graph theory. 
Here we consider the variable indices i as vertices of a graph Q each having weight 
\uji\. The edges of Q are implicitely given by inequality (2). Therefore let us denote the 
i's row of M G by Mf.. Now the inequality Mf. < 1 translates into the condition that 
the set 

C t := {j : Mg - 1} 

is an independence set in Q. To construct the graph Q we start with a complete graph and 
for each row Mf. we delete all edges between vertices in Cj. Now an optimal solution 
of the binary linear program corresponds to a maximum weight clique in Q. Again there 
exist heuristics and exact algorithms to determine maximum weight cliques in graphs. 
An available software package for this purpose is e.g. CLIQUER [16]. 

This approach allows to use the whole bunch of clique bounds from algebraic graph 
theory to obtain upper bounds on the target function /(x). In the case where we are able 
to locate large independent sets in Q which are not subsets of the Cj we can use them 
to add further inequalities to (2). If those independent sets are large enough and not too 
many then a solver for integer linear programs highly benefits from the corresponding 
additional inequalities. 

For theoretical upper bounds and practical reasons how to quickly or exhaustively locate 
solutions of our system of Theorem 2 it is very useful to have different formulations of 
our problem to be able to apply different solvers. 



3.1 Example 



We start with the space GF(2) 7 . We now describe the construction of a subspace code 
with 304 codewords and constant dimension equal to 3. This code will have minimum 
subspace distance 4. The matrix M is the incidence matrix between the 3— dimensional 
subspaces of GF(2) 7 and the 2— dimensional subspaces. Without further reductions 



this matrix has 



= 11811 columns and 



= 2667 rows. We prescribe now a 



group G of automorphisms generated by a single element: 



(I 



G:= 



\ 



V 



1 

II 11 
11 11 

III 11/ 



This group G has 567 orbits on the 3— spaces and 129 orbits on the 2— spaces. Using 
Theorem 3 we can formulate the search for a large constant dimension code as a binary 
linear maximization problem having 129 constraints and 567 binary variables. After a 
presolving step, automatically performed by the ILOG CPLEX software, there remain 
only 477 binary variables and 126 constraints with 3306 nonzero coefficients. 

After some minutes the software founds a (0/1)— solution with 16 variables equal to 
one. Taking the union of the corresponding 16 orbits on the 3— spaces of GF(2) 7 we 
got a constant dimension space code with 304 codewords having minimum distance 4. 
Previously known was a code with 289 codewords obtained from a construction using 
rank-metric codes ([18] p. 28) and another code consisting of 294 subspaces discovered 
by A. Vardy (private email communication). 

In general it is difficult to construct the condensed matrix M G for an arbitrary group and 

larger parameters v and k as the number of subspaces given by the Gaussian polynomial 

v 

, grows very fast and it becomes difficult to compute all the orbits necessary for the 

-I <? 

computation of M G . In the following section we give a method to get a similar matrix 
in special cases. 



4 Using Singer Cycles 



A special case of the above method is the use of a Singer cycle. We use for the re- 
duction a Singer subgroup of GL(v, q) which acts transitively on the one-dimensional 
subspaces of GF(q) v . Singer cycles have been used in many cases for the construction 
of interesting geometric objects [9]. We will now describe a method to construct a set 
C of fc— subspaces of GF(q) v with the following two special properties: 



1. C has the Singer subgroup as a subgroup of its group of automorphisms. 

2. The dimension of the intersection of two spaces from C is at most one. 

Such a set C of course is a constant dimension subspace code of minimum distance 

2(fc — 1). This is a special of the situation of Theorem 3. We now fix one generator g E 

GL(v, q) of a Singer subgroup G and a one-dimensional subspace V of GF(q) v . As G 

acts transitively on the one-dimensional subspaces we can label any one-dimensional 

v 
subspace W by the unique exponent i between and I := , — 1 with the property 

[fcj 2 

that W = g l V. Given a fc— spaces U we can describe it by the set of one-dimensional 

(i.e. numbers between and I) subspaces contained in U. Given such a description 

of a fc— spaces it is now easy to get all the spaces building the orbit under the Singer 

subgroup G. You simply have to add one to each number and you will get the complete 

orbit if you do this I times. 

Example 1. q = 2, v = 5, fc = 2 

A two-dimensional binary subspace contains three one-dimensional subspaces. We get 
a two-dimensional space by taking the two one-dimensional spaces labeled {0, 1} and 
the third one given by the linear combination of these two will have a certain num- 
ber in this example {14}. Therefore we have a two dimensional space described by 
the three numbers {0, 1, 14}. Two get the complete orbit under the Singer subgroup 
we simply has to increase the numbers by one for each multiplication by the gen- 
erator g of the Singer subgroup. The orbit length of the Singer subgroup is 31 and 
the orbit is build by the 31 sets: {0, 1, 14}, {1, 2, 15}, . . . , {16, 17, 30}, {0, 17, 18}, . . . 
{12, 29, 30}, {0,13, 30}. 

To describe the different orbits of the Singer subgroup we build the following set of 
pairwise distances: 

Denote by s the number of one-dimensional subspaces in fc— space. Let {v\ , . . . , v s } C 
{0, 1, . . . , 1} be the set of s numbers describing a fc— space U. Denote by d^^x the 
distance between the two numbers Vi and Vj modulo the length of the Singer cycle. 
dujx is a number between 1 and 1/2. We define the multiset Djj := {dujx : 1 < i < 
j < s}. We call Djj the distance distribution of the subspace U. All the spaces in an 
orbit of a Singer subgroup have the same distance distribution and on the other hand 
different orbits have different distance distribution. We therefore also say that Djj is the 
distance distribution of the orbit. 

We use these distance distribution to label the different orbits of the Singer subgroup of 
the fc— spaces. The first observation is: 

Lemma 1. A Singer orbit as a subspace code 

An orbit C = {Vo, . . . , Vi} of Singer subgroup on the fc— subspaces of GF(q) v is a 
subspace code of minimum distance 2(fc — 1) if and only if the distance distribution of 
the orbit has no repeated numbers. 



Proof. We have to show that the intersection of any pair of spaces in C has at most one 
one-dimensional space in common. Having no repeated entry in the distance distribu- 
tion means that a pair of numbers (i.e. pair of one-dimensional subspaces) in a q— block 
b of C can not be built again by shifting the numbers in b using the operation of the 
Singer subgroup on b. 

The same is true if we want to construct a subspace code by combining several orbits 
of the Singer subgroup. We have to check that the intersection between two spaces is at 
most one-dimensional for this we define the matrix S, whose columns are labeled by the 
orbits Qj of the Singer subgroup on the fc— dimensional subspaces of GF(q) v and the 
rows are labeled by the possible number i £ {0, . . . , 1/2} in the distance distribution of 
the fc— spaces. Denote by Dq. the distance distribution of the j— th orbit then we define 
an entry of the matrix S by 

_ f 1 if i E D i2] 
l '^ " \ otherwise. ' 

Using this matrix S we have the following characterization of constant dimension codes 
with prescribed automorphisms: 

Theorem 3. 

There is a constant dimension code C with n ■ (I + 1) codewords and minimum distance 
at least 2(fc — 1) whose group of automorphisms contains the Singer subgroup as a 
subgroup if and only if there is a (0/1)— solution x = (xi, . . ) T of the following 
system of equations: 

(1) Sxi = n, 

(2) 




This is the final system of one Diophantine linear equation together with 1/2 + 1 in- 
equalities which we successfully solved in several cases. 



5 Results 

As mentioned in the introduction there is an increased interest on constant dimension 
codes with a large number of codewords for a given minimum subspace distance. There 
are (very) recent ArXi V-preprints [10,11,18] giving some constructions for those codes . 

Here we restrict ourselves on the binary field q = 2 and dimension fc = 3 and minimum 
subspace distance d$ = 4. 



Using the approach described in Section 4 it was possible to construct constant dimen- 
sion codes using the Singer cycle with the following parameters. We denote by n the 
number of orbits used to build a solution, by d we denote minimum space distance of 
the corresponding constant dimension space code: 



V 


k 


n 


■#■ orbits 


■#■ codewords 


best 


d 


G 


3 


1 


19 


1 • 63 = 63 


71[18] 


4 


7 


3 


2 


93 


2-127 = 254 


294 


4 


8 


3 


5 


381 


5-255 = 1275* 


1164[18] 


4 


9 


3 


11 


1542 


11-511 = 5621* 


4657[18] 


4 


10 


3 


21 


6205 


21 • 1023 = 21483* 


18631[18] 


4 


11 


3 


39 


24893 


39 • 2047 = 79833* 


74531 [18] 


4 


12 


3 


77 


99718 


77-4095 = 315315* 


298139[18] 


4 


13 


3 


141 


399165 


141-8191 = 1154931 


1192587[18] 


4 


14 


3 


255 


1597245 


255 • 16383 = 4177665 


4770411[18] 


4 



In [11] the authors defined the number A q (v,d, k) as the maximal number of codewords 
in a constant dimension subspace code of minimum distance d. They derived lower 
and upper bounds. We have implemented the construction method described in [18] to 
obtain the resulting code sizes which give the lower bounds for A q (v, d, k) for v > 9. 
In the above table we marked codes which improved the lower bounds on A q (v, d, k) 
with an *. We would like to remark that for 6 < v < 8 our results are optimal for the 
Singer cycle as a subgroup of the group of automorphisms. So far for v = 9 a code 
size of n = 12 is theoretically possible. (In this case the corresponding binary linear 
program was not solved to optimality.) 

Since for v = 6, 7 the method using the Singer cycle was not capable of beating the 
best known constant dimension code we tried the more general approach described in 
Section 3. 

For v = 6 even the original incidence matrix M or M G where G is the identity group 
results in only 1395 binary variables and 651 constraints having 9765 nonzero entries. 
Using the ILOG CPLEX 11.1.0 solver directly on this problem yields a constant dimen- 
sion code of cardinality n = 74 which beats the example of [10,18] by 3. The upper 
bound in this case is given by 93. 

As mentioned in Example 3.1 the original incidence matrix M is quite large. Here the 
direct approach has not lead to any improvements. Although in general it is difficult to 
construct the condensed matrix M G for an arbitrary group and larger parameters we 
were able to conquer the difficulties for v = 7, k = 3, d$ = 4 and some groups. The 
group resulting in the code having 304 three-dimensional subspaces of GF(2) 7 such 
that the intersection of two codewords has dimension at most one was already given in 
Example 3.1. We have tried several groups before ending up with this specific group. 
More details can be shown using the following diagram: 




order 3 

43,903,3951 

304-381 





order 3 






order 2 




order? 




order 127 


Singer 




47,897,3961 




95,1675,6851 




19,381,1695 




1,21,93 






263-381 




93-105 




304-381 




254 






identity 








127,2667,11811 














304-381 















This picture shows part of the subgroup lattice of the automorphism group PGL(7, 2) 
of the L(GF(2) 7 ). It only shows cyclic groups and in the top row we give the order 
of the group. In the second row we give the number of orbits on the points, lines and 
planes. In the third row of each entry we give the size lb of a subspace code and the 
best found upper bound ub in the format lb — ub. As described in Section 3 for a given 
group our problem corresponds to several versions of feasibility or optimization prob- 
lems. To obtain the lower bounds we have used the LLL based algorithm, the coding 
theoretic motivated heuristic and the ILOG CPLEX solver for integer linear programs. 
The upper bounds were obtained by the CPLEX solver stopping the solution process 
after a reasonable time. Whenever the lower and the upper bound meet we have written 
only one number in bold face. In each of these cases we give the necessary computation 
time to prove optimality in the forth row. 

As we can split orbits if we move to a subgroup we can translate a solution found for a 
group G into a solution for a subgroup of G. E.g. for the groups of order smaller than 
21 we did not find codes of size 304 directly. This fact enables us to perform a restricted 
search in systems corresponding to subgroups by only considering solutions which are 
in some sense near to such a translated solution. We have tried this for the subgroups 
of the group of order 21 - unfortunately without success. 

We would like to remark that solving the linear relaxation can prevent other heuristics 
from searching for good solutions where no good solutions can exist. E.g. we can calcu- 
late in a second that every code in the case of the third group in the third row can contain 
at most 106 codewords. Since we know better examples we can skip calculations in this 
group and all groups which do contain this group as a subgroup. 

Finally we draw the conclusion that following the approach described in Section 3 
it is indeed possible to construct good constant dimension codes for given minimum 
subspace distance. Prescribing the Singer cycle as a subgroup of the automorphism 
group has some computational advantages. The resulting codes are quite competitive 



for v > 8. The discovered constant dimension codes for v — 6, 7 show that it pays off 
to put some effort in the calculation of the condensed matrix M G for other groups. 
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